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Abstract 

A fault-tolerant structure for a network is required to continue functioning fol- 
lowing the failure of some of the network's edges or vertices. This paper considers 
breadth-first search (BFS) spanning trees, and addresses the problem of designing 
a sparse fault-tolerant BFS tree, or FT-BFS tree for short, namely, a sparse sub- 
graph T of the given network G such that subsequent to the failure of a single edge 
or vertex, the surviving part T' of T still contains a BFS spanning tree for (the 
surviving part of) G. For a source node s, a target node t and an edge e E G, 
the shortest s — t path P s j,e that does not go through e is known as a replacement 
path. Thus, our FT-BFS tree contains the collection of all replacement paths P s j,e 
for every t E V(G) and every failed edge e E E(G). 

Our main results are as follows. We present an algorithm that for every n- vertex 
graph G and source node s constructs a (single edge failure) FT-BFS tree rooted at 
s with 0(n • min{Depth(s), ^Jn}) edges, where Depth(s) is the depth of the BFS 
tree rooted at s. This result is complemented by a matching lower bound, showing 
that there exist n- vertex graphs with a source node s for which any edge (or vertex) 
FT-BFS tree rooted at s has ft(n 3 / 2 ) edges. 

We then consider fault-tolerant multi-source BFS trees, or FT-MBFS trees for 
short, aiming to provide (following a failure) a BFS tree rooted at each source s E S 
for some subset of sources S CV. Again, tight bounds are provided, showing that 
there exists a poly-time algorithm that for every n-vertex graph and source set 
S C V of size a constructs a (single failure) FT-MBFS tree T*(S) from each source 
Si E 5, with 0{\fa • n 3 / 2 ) edges, and on the other hand there exist n-vertex graphs 
with source sets S C V of cardinality a, on which any FT-MBFS tree from S has 
^(v^ • n 3 / 2 ) edges. 
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Finally, we propose an 0(log n) approximation algorithm for constructing FT-BFS 
and FT-MBFS structures. The latter is complemented by a hardness result stating 
that there exists no fi(logn) approximation algorithm for these problems under 
standard complexity assumptions. In comparison with the randomized FT-BFS con- 
struction implicit in [14J, our algorithm is deterministic and may improve the num- 
ber of edges by a factor of up to ^Jn for some instances. All our algorithms can be 
extended to deal with one vertex failure as well, with the same performance. 

1 Introduction 

Background and motivation Modern day communication networks support a variety 
of logical structures and services, and depend on their undisrupted operation. As the 
vertices and edges of the network may occasionally fail or malfunction, it is desirable 
to make those structures robust against failures. Indeed, the problem of designing fault- 
tolerant constructions for various network structures and services has received considerable 
attention over the years. 

Fault-resilience can be introduced into the network in several different ways. This 
paper focuses on a notion of fault-tolerance whereby the structure at hand is augmented 
or "reinforced" (by adding to it various components) so that subsequent to the failure 
of some of the network's vertices or edges, the surviving part of the structure is still 
operational. As this reinforcement carries certain costs, it is desirable to minimize the 
number of added components. 

To illustrate this type of fault tolerance, let us consider the structure of graph k- 
spanners (cf. [TTJ, [EH [20]). A graph spanner H can be thought of as a skeleton structure 
that generalizes the concept of spanning trees and allows us to faithfully represent the 
underlying network using few edges, in the sense that for any two vertices of the network, 
the distance in the spanner is stretched by only a small factor. More formally, consider 
a weighted graph G and let k > 1 be an integer. Let dist(^,^,G) denote the (weighted) 
distance between u and v in G. Then a fc-spanner H satisfies that dist(^,^,i7) < k • 
dist(?i, v, G) for every u ) v G V. 

Towards introducing fault tolerance, we say that a subgraph H is an f-edge fault- 
tolerant k- spanner of G if dist(?x, v ) H \ F) < k • dist(^, v,G\ F) for any set F C E of 
size at most /, and any pair of vertices u, v G V. A similar definition applies to /-vertex 
fault-tolerant fc-spanners. Sparse fault-tolerant spanner constructions were presented in 

pen. 

This paper considers breadth-first search (BFS) spanning trees, and addresses the 
problem of designing fault-tolerant BFS trees, or FT-BFS trees for short. By this we mean 
a subgraph T of the given network G, such that subsequent to the failure of some of the 
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vertices or edges, the surviving part T f of T still contains a BFS spanning tree for the 
surviving part of G. We also consider a generalized structure referred to as & fault-tolerant 
multi-source BFS tree, or FT-MBFS tree for short, aiming to provide a BFS tree rooted at 
each source s G S for some subset of sources S CV. 

The notion of FT-BFS trees is closely related to the problem of constructing replacement 
paths and in particular to its single source variant, the single-source replacement paths 
problem, studied in [14]. That problem requires to compute the collection V s of all s — t 
replacement paths P s j,e for every t G V and every failed edge e that appears on the s — t 
shortest-path in G. The vast literature on replacement paths (cf. [ll[T4l[23l[25l[28]) focuses 
on time- efficient computation of the these paths as well as their efficient maintenance in 
data structures (a.k.a distance oracles). In contrast, the main concern in the current 
paper is with optimizing the size of the resulting fault tolerant structure that contains 
the collection V s of all replacement paths given a source node s. A typical motivation 
for such a setting is where the graph edges represent the channels of a communication 
network, and the system designer would like to purchase or lease a minimal collection of 
channels (i.e., a subgraph G f C G) that maintains its functionality as a "BFS tree" with 
respect to the source s upon any single edge or vertex failure in G. In such a context, the 
cost of computation at the preprocessing stage may often be negligible compared to the 
purchasing/leasing cost of the resulting structure. Hence, our key cost measure in this 
paper is the size of the fault tolerant structure, and our main goal is to achieve sparse (or 
compact) structures. 

Most previous work on sparse / compact fault-tolerant structures and services con- 
cerned structures that are distance-preserving (i.e., dealing with distances, shortest paths 
or shortest routes), global (i.e., centered on "all-pairs" variants), and approximate (i.e., 
settling for near optimal distances) , such as spanners, distance oracles and compact rout- 
ing schemes. The problem considered here, namely, the construction of FT-BFS trees, 
still concerns a distance preserving structure. However, it deviates from tradition with 
respect to the two other features, namely, it concerns a "single source" variant, and it 
insists on exact shortest paths. Hence our problem is on the one hand easier, yet on the 
other hand harder, than previously studied ones. Noting that in previous studies, the 
"cost" of adding fault-tolerance (in the relevant complexity measure) was often low (e.g., 
merely polylogarithmic in the graph size n), one might be tempted to conjecture that 
a similar phenomenon may reveal itself in our problem as well. Perhaps surprisingly, it 
turns out that our insistence on exact distances plays a dominant role and makes the 
problem significantly harder, outweighing our willingness to settle for a "single source" 
solution. 

Contributions We obtain the following results. In Sec. [2j we define the Minimum 
FT-BFS and Minimum FT-MBFS problems, aiming at finding the minimum such structures 
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tolerant against a single edge or vertex fault. Section [3] presents lower bound constructions 
for these problems. For the single source case, in Subsec. |3.1[ we present a lower bound 
stating that for every n there exists an n- vertex graph and a source node s C V for which 



any FT-MBFS tree from s requires Q(n 3 / 2 ) edges. In Subsec. 3.2, we then show that there 
exist n- vertex graphs with source sets S C V of size a, on which any FT-MBFS tree from 
the source set S has Vt(yfa • n 3 / 2 ) edges. 



These results are complemented by matching upper bounds. In Subsec. 4.1 



we 



present a simple algorithm that for every n- vertex graph G and source node s, constructs 
a (single edge failure) FT-BFS tree rooted at s with 0(n • min{Depth(<s), \/n}) edges. A 
similar algorithm yields an FT-BFS tree tolerant to one vertex failure, with the same size 
bound. In addition, for the multi source case, in Subsec. |4.2[ we show that there exists 
a polynomial time algorithm that for every n- vertex graph and source set S C V of size 
\S\ = a constructs a (single failure) FT-MBFS tree T*(S) from each source G 5, with 
0(y/a • n 3/2 ) edges. 

In Sec. [5| we show that the minimum FT-BFS problem is NP-hard and moreover, 
cannot be approximated (under standard complexity assumptions) to within a factor of 
f](logn), where n is the number of vertices of the input graph G. Note that while the 
algorithms of Sec. [4] match the worst-case lower bounds, they might still be far from 
optimal for certain instances, as illustrated in Sec. (6) Consequently, in Sec. [6| we 
complete the upper bound analysis by presenting an O(logn) approximation algorithm 
for the Minimum FT-MBFS problem. This approximation algorithm is superior in instances 
where the graph enjoys a sparse FT-MBFS tree, hence paying 0(n 3 / 2 ) edges (as does the 
algorithm of Sec. [4]) is wasteful. In light of the hardness result for these problems (of Sec. 
[5]), the approximability result is tight (up to constants). 



Related work To the best of our knowledge, this paper is the first to study the spar- 
sity of fault-tolerant BFS structures for graphs. The question of whether it is possible 
to construct a sparse fault tolerant spanner for an arbitrary undirected weighted graph, 
raised in [9j, was answered in the affirmative in [6], presenting algorithms for construct- 
ing an /-vertex fault tolerant (2k — l)-spanner of size 0(f 2 k^ 1 • n 1Jrl ^ k log 1-1 ^ n) and 
an /-edge fault tolerant 2k — 1 spanner of size 0(f • n 1+1 ^ h ) for a graph of size n. A 
randomized construction attaining an improved tradeoff for vertex fault-tolerant span- 
ners was shortly afterwards presented in [11], yielding (with high probability) for every 
graph G = (V,E) : odd integer s and integer /, an /-vertex fault-tolerant s-spanner 
/ s +i n logn) edges. This should be contrasted with the best stretch-size 
tradeoff currently known for non-fault-tolerant spanners [24], namely, 2k — 1 stretch with 
0{n 1Jrl/k ) edges. 

An efficient algorithm that given a set V of n points in d-dimensional Euclidean space 
constructs an /-vertex fault tolerant geometric (1 + e)-spanner for V namely, a sparse 
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graph H satisfying that dist(/u, v,H \ F) < (1 + e)dist(?x, G) for any set F C y of 
size / and any pair of points u,v G V \ F, was presented in [T5] . A fault tolerant 
geometric spanner of improved size was later presented in [15] ; finally, a fault tolerant 
geometric spanner of optimal maximum degree and total weight was presented in [9] . The 
distinction between the stronger type of fault-tolerance obtained for geometric graphs 
(termed rigid fault-tolerance) and the more flexible type required for handling general 
graphs (termed competitive fault-tolerance) is elaborated upon in [18] . 

A related network service is the distance oracle [3j [22], [25] , which is a succinct data 
structure capable of supporting efficient responses to distance queries on a weighted graph 
G. A distance query (s,£) requires finding, for a given pair of vertices s and t in V ) the 
distance (namely, the length of the shortest path) between u and v in G. The query 
protocol of an oracle S correctly answers distance queries on G. In & fault tolerant distance 
oracle, the query may include also a set F of failed edges or vertices (or both), and the 
oracle S must return, in response to a query F), the distance between s and t in 
G f = G \ F. Such a structure is sometimes called an F -sensitivity distance oracle. The 
focus is on both fast preprocessing time, fast query time and low space. It has been 
shown in [10] that given a directed weighted graph G of size n, it is possible to construct 
in time 0{mn 2 ) a 1-sensitivity fault tolerant distance oracle of size 0{n 2 logn) capable of 
answering distance queries in 0(1) time in the presence of a single failed edge or vertex. 
The preprocessing time was recently improved to O(mn), with unchanged size and query 
time [4J. A 2-sensitivity fault tolerant distance oracle of size 0(n 2 log 3 n), capable of 
answering 2-sensitivity queries in O(logn) time, was presented in [T2] . 

Recently, distance sensitivity oracles have been considered for weighted and directed 
graphs in the single source setting [14]. Specifically, Grandoni and Williams considered 
the problem of single-source replacement paths where one aims to compute the collection 
of all replacement paths for a given source node s, and proposed an efficient random- 
ized algorithm that does so in 0{APSP{n ) M)) where APSP(n, M) is the time required 
to compute all-pairs-shortest-paths in a weighted graph with integer weights [— M, M\. 
Interestingly, although their algorithm does not aim explicitly at minimizing the total 
number of edges used by the resulting collection of replacement paths, one can show that 
the resulting construction yields a rather sparse path collection, with at most 0(n 3 ^ 2 logn) 
edges (although it may also be far from optimal in some instances). 

Label-based fault-tolerant distance oracles for graphs of bounded clique-width are 
presented in [8]. The structure is composed of a label L(v) assigned to each vertex v ) and 
handles queries of the form (L(s), £(£), F) for a set of failures F. For an n- vertex graph 
of tree-width or clique-width k ) the constructed labels are of size 0(/c 2 log 2 n). 

A relaxed variant of distance oracles, in which distance queries are answered by ap- 
proximate distance estimates instead of exact ones, was introduced in [25] . where it was 
shown how to construct, for a given weighted undirected n- vert ex graph G, an approxi- 
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mate distance oracle of size 0(n 1+1 ^ k ) capable of answering distance queries in 0(k) time, 
where the stretch (multiplicative approximation factor) of the returned distances is at 
most 2k — 1. 

An /-sensitivity approximate distance oracle S was presented in [5j. For an integer 

1 ■ 8(/+l) 

parameter k > 1, the size of S is 0(kn fc + 2 (/+ 1 ) log (nW)), where W is the weight of the 
heaviest edge in G, the stretch of the returned distance is 2k — 1, and the query time is 
0(|F| • log 2 n • log log n • log log d), where is the distance between 5 and t in G\F. 

A fault-tolerant label-based (l+e)-approximate distance oracle for the family of graphs 
with doubling dimension bounded by a is presented in [2]. For an n- vertex graph G(V, E) 
in this family, and for desired precision parameter e > 0, the distance oracle constructs and 
stores an 0(logn/e 2a )-bit label at each vertex. Given the labels of two end- vertices s and 
t and of collections F v and F E of failed (or "forbidden") vertices and edges, the oracle 
computes, in time polynomial in the length of the labels, an estimate for the distance 
between s and t in the surviving graph G(V \ Fy, E \ F E ), which approximates the true 
distance by a factor of 1 + e. 

Our final example concerns fault tolerant routing schemes. A fault-tolerant routing 
protocol is a distributed algorithm that, for any set of failed edges F, enables any source 
vertex s to route a message to any destination vertex d along a shortest or near-shortest 
path in the surviving network G \ F in an efficient manner (and without knowing F in 
advance) . 

In addition to route efficiency, it is often desirable to optimize also the amount of 
memory stored in the routing tables of the vertices, possibly at the cost of lower route 
efficiency, giving rise to the problem of designing compact routing schemes (cf. [H fT71 

EH EH). 

Label-based fault-tolerant routing schemes for graphs of bounded clique-width are 
presented in [8]. To route from s to £, the source needs to specify the labels L(s) and 
L(t) and the set of failures F, and the scheme efficiently calculates the shortest path 
between s and t that avoids F. For an n- vertex graph of tree- width or clique- width k ) 
the constructed labels are of size 0{k 2 log 2 n). 

Fault-tolerant compact routing schemes are considered in [5j, for up to two edge fail- 
ures. Given a message M destined to t at a source vertex 5, in the presence of a failed edge 
set F of size \F\ < 2 (unknown to 5), the scheme presented therein routes M from s to t in 
a distributed manner, over a path of length at most O(k) times the length of the optimal 
path (avoiding F). The total amount of information stored in vertices of G on average is 
bounded by 0(kn 1+1 ^ k ). This should be compared with the best memory-stretch tradeoff 
currently known for non-fault-tolerant compact routing [24], namely, 2k — 1 stretch with 
0(n 1+1 / k ) memory per vertex. 

A compact routing scheme capable of handling multiple edge failures is presented 
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in [7j. The scheme routes messages (provided their source s and destination t are still 
connected in the surviving graph G\F) over a path whose length is proportional to the 
distance between s and t in G \ F, to |F| 3 and to some poly-log factor. The routing table 
required at a node v is of size proportional to v's degree and some poly-log factor. 

A routing scheme with stretch 1 + e for graphs of bounded doubling dimension is also 
presented in [2j. The scheme can be generalized also to the family of weighted graphs of 
bounded doubling dimension and bounded degree. In this case, the label size will also 
depend linearly on the maximum vertex degree A, and this is shown to be necessary. 

2 Preliminaries 

Notation Given a graph G = (V, E) and a source node s, let Tq(s) CGbea shortest 
paths (or BFS) tree rooted at s. For a source node set S C V , let T (S) = {J seS T (s) be 
a union of the single source BFS trees. Let 7r(s, v,T) be the s — v shortest-path in tree 
T, when the tree T = T (s), we may omit it and simply write tt(s,v). Let T(v,G) be 
the set of v neighbors in G. Let E{y,G) = {(u,v) G E(G)} be the set of edges incident 
to v in the graph G and let deg(v,G) = \E{y,G)\ denote the degree of node v in G. 
When the graph G is clear from the context, we may omit it and simply write deg(^). 
Let depth(s, = dist(s, v,G) denote the depth of v in the BFS tree T (s). When the 
source node s is clear from the context, we may omit it and simply write depth(^). Let 
Depth(s) = max wG y{depth(s, u)} be the depth of T (s). For a subgraph G' = (V\ E') C G 
(where V C V and E' C E) and a pair of nodes u,v G V, let dist(^, v : G') denote the 
shortest-path distance in edges between u and v in G r . For a path P = [v\, . . . , Vk] ) let 
LastE(P) be the last edge of path P. Let \P\ denote the length of the path and P[v i) vj] 
be the subpath of P from V{ to Vj. For paths Pi and P 2j P\° P2 denote the path obtained 
by concatenating P 2 to P\. Assuming an edge weight function W : E{G) —> M + , let 
SP(s, Vi, G, W) be the set of s — vi shortest-paths in G according to the edge weights of 
W. Throughout, the edges of these paths are considered to be directed away from the 
source node s. Given ans-u path P and an edge e = (x, y) G P, let dist(s, e, P) be the 
distance (in edges) between s and e on P. In addition, for an edge e = (x,y) G Tq(s), 
define dist(s, e) = i if depth(x) = i — 1 and depth(y) = i. 

Definition 2.1 A graph T* is an edge (resp., vertex) FT-BFS tree for G with respect to 
a source node s G V , iff for every edge f G E{G) (resp., vertex f G V) and for every 
veV, dist(s, v, T* \ {/}) = dist(s, v, G \ {/}). 

A graph T* is an edge (resp., vertex) FT-MBFS tree for G with respect to source set 
S C V, iff for every edge f G E{G) (resp., vertex f GV ) and for every s G S and v G V , 
dist(s, v, T* \ {/}) = dist(s, v, G \ {/}). 
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To avoid cumbersome notation, we refer to edge FT-BFS (resp., edge FT-MBFS) trees 
simply by FT-BFS (resp., FT-MBFS) trees. Throughout, we focus on edge fault, yet the 
entire analysis extends trivially to the case of vertex fault as well. 



The Minimum FT-BFS problem Denote the set of solutions for the instance (G, s) by 
TO, G) = {f C G | f is an FT-BFS tree w.r.t. s}. Let Cost*(s, G) = mm{\E(T )| | f G 
T(s, G)} be the minimum number of edges in any FT-BFS subgraph of G. These definitions 
naturally extend to the multi-source case where we are given a source set S C V of size 
a. Then 

T(S, G) = {TQG\T is a FT-MBFS with respect to S} 

and 

Cost*(S,G9 = min{|£7(f)| | feT(S,G)}. 

In the Minimum FT-BFS problem we are given a graph G and a source node s and 
the goal is to compute an FT-BFS T 6 T(s, G) of minimum size, i.e., such that \E{T)\ = 
Cost* (5, G). Similarly, in the Minimum FT-MBFS problem we are given a graph G and a 
source node set S and the goal is to compute an FT-MBFS T 6 T{S ) G) of minimum size 
i.e., such that \E(f)\ = Cost* (5, G). 



3 Lower Bounds 



In this section we establish lower bounds on the size of the FT-BFS and FT-MBFS structures. 



In Subsec. |3.1| we consider the single source case and in Subsec. |3.2| we consider the case 
of multiple sources. 



3.1 Single Source 

We begin with a lower bound for the case of a single source. 

Theorem 3.1 There exists an n-vertex graph G(V, E) and a source node s £ V such that 
any FT-BFS tree rooted at s has Q(n 3/2 ) edges, i.e., Cost*(s, G) = Q(n 3/2 ). 

Proof: Let us first describe the structure of the graph G = (V, E). Set d = LvW^J- 

The graph consists of four main components. The first is a path n = [s — . . . , Vd+i = 
v*] of length d. The second component consists of a node set Z = {24, . . . , zj] and a collec- 
tion of d disjoint paths of deceasing length, Pi, ... , where Pj — [vj = p[, . . . , Zj = p J tj ] 
connects vj with Zj and its length is tj = \Pj\ = 6 + 2{d — j), for every j £ 1, • • • , d. 
Altogether, the set of nodes in these paths, Q = Uj=i ^ s °f s ^ ze \Q\ = d 2 + 7d. The 
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third component is a set of nodes X of size n — (d 2 + 7d), all connected to the terminal 
node v*. The last component is a complete bipartite graph B = (X, Z, E) connecting X 
to Z. Overall, V = XuQzmdE = EU E(tt) U |jj = i ^(^)- Note that rc/4 < |Q| < rc/2 
for sufficiently large n. Consequently, \X\ = n — \Q\ > n/2, and |P| = \Q\ • \X\ > n 3 / 2 /4. 




Figure 1: Lower bound construction for FT-BFSThe original BFS tree consists of the non- 
dashed edges. The dashed edges are the ones necessary to make it an FT-BFS tree. For 
example, the bold dashed edge (xj, Zi) is required upon failure of the edge e*. 

A BFS tree T rooted at s for this G (illustrated by the solid edges in Fig. [j]) is given 

by 

d 

E(T ) = {(*^)|*e{i,...,4}uU^ 

3=1 

where £j = tj — (d — j) for every j G {1, . . . , d}. We now show that every FT-BFS tree 
T f G T{s ) G) must contain all the edges of P, namely, the edges e^- = (x^Zj) for every 
i G {1, . . . , \X\} and j G {1, . . . , d} (the dashed edges in Figure [j]). Assume, towards 
contradiction, that there exists a T' G T(s, G) that does not contain e^- (the bold dashed 
edge (x i: Zj) in the figure), (the bold dashed edge (x i: zj) in Figure [I]). Note that upon the 
failure of the edge ej = (vj,Vj+i) G 7r, the unique s — Xi shortest-path connecting s and 
Xi in G \ {ej} is P- = 7r[vi, vj] o P, o [^-, x^], and all other alternatives are strictly longer. 
Since £ T' also P- ^ T', and therefore dist(s, x^ G \ {ej}) < dist(s, x^ T' \ {e^}), in 
contradiction to the fact that T f is an FT-BFS tree. It follows that every FT-BFS tree T' 
must contain at least \E\ = Q(n 3 / 2 ) edges. The theorem follows. | 
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3.2 Multiple Sources 



We next consider an intermediate setting where it is necessary to construct a fault-tolerant 
subgraph FT-MBFS containing several FT-BFS trees in parallel, one for each source sGS, 
for some S CV. We establish the following. 

Theorem 3.2 There exists an n-vertex graph G{V ) E) and a source set S C V of cardi- 



nality a, such that any FT-MBFS tree from the source set S has Vt{^ 
Cost*(S, G) = ^(v^ • n 3/2 ). 



TV 



3/2^ 



edges, i.e., 



Proof: Our construction is based on the graph G(d) = (Vi,£i), which consists of three 
components: (1) a set of vertices U = {^i, . . . , Ud} connected by a path Pi = [u\ ) . . . ) Ud\ ) 
(2) a set of terminal vertices Z = {zi, . . . , zj} (viewed by convention as ordered from left 
to right), and (3) a collection of d vertex disjoint paths Qi of length \Qi\ = 6 + 2{d — i) 
connecting U{ and Zi for every i G Thus \Q\\ > ... > \Q\\. The vertex 

r(G(d)) = Ud is fixed as the root of G{d) ) hence the edges of the paths Qi are viewed 
as directed away from and the terminal vertices of Z are viewed as the leaves of the 
graph, denoted Leaf (G(d)) = Z. See Fig. [2] for illustration. 



©00© 



Figure 2: The graph G\(d). 

Overall, the vertex and edge sets of G{d) are V\ — U U Z U Uf=i V(Qi) an d ^i 
E(Pi)U[jUE(Qi)- 

Observation 3.3 (a) The number of leaves in G{d) is |Leaf (G(d))\ = d. 
(b) \V\\ = c • d 2 for some constant c. 
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3.3 



Take a copies, G[, . . . , G f al of G(d), where d = 0((n/ a) 1 / 2 ). Note that Obs. 
each copy G\ consists of 0(n/a) nodes. Let yi be the node Ud and Si = r(G f i ) in the 
ith copy G\. Add a node v* connected to a set X of Q(n) nodes and connect v* to 
each of the nodes for i G {1, . . . Finally, connect the set X to the a leaf sets 
Leaf (Gi), . . . , Leaf (G f a ) by a complete bipartite graph, adjusting the size of the set X 
in the construction so that = n. Since nLeaf {G[) = ^((n/a) 1 / 2 ) (see Obs. 3.3), 

overall \E(G)\ = Q(n • a • nLeaf (Gi(d))) = (an) 1 / 2 ). Since the path from each source 
Si to X cannot aid the nodes of G'- for j ' ^ z, the analysis of the single-source case can be 
applied to show that each of the bipartite graph edges in necessary upon a certain edge 
fault. See Fig. [3] for an illustration. | 




Figure 3: Illustration of the lower bound for the multi-source case. 



4 Upper Bounds 

In this section we provide tight matching upper bounds to the lower bounds presented in 
Sec. 1 



4.1 Single Source 

For the case of FT-BFS trees, we establish the following. 
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Theorem 4.1 There exists an 0{nm) time algorithm that for every n-vertex graph G 
and source node s constructs an FT-BFS tree rooted at s with 0(n • min{Depth(<s), y^}) 
edges. 

To prove the theorem, we first describe a simple algorithm for the problem and then prove 
its correctness and analyze the size of the resulting FT-BFS tree. We note that using the 
sparsity lemma of [23] and the tools of [14] , one can provide a randomized construction for 
an FT-BFS tree with 0(n 3 / 2 logn) edges with high probability. In contrast, the algorithm 
presented in this paper is deterministic and achieve an FT-BFS tree with 0(n 3 / 2 ) edges, 
matching exactly the lower bound established in Sec. |3| 

The Algorithm To avoid complications due to shortest-paths of the same length, we 
assume all shortest-path are computed with a weight assignment W that guarantees the 
uniqueness of the shortest-paths. This can be achieved by considering a weight function 
W defined so as to ensure that the shortest paths are also of minimal number of edges but 
at the same time guarantees the uniqueness of the u — v shortest-path, for every u, v G V. 
Let ei, . . . , e m be some arbitrary ordering of E(G). Then set Wie^) = 2 m+1 + 2 k . Let 
T = BFS(s, G) be the BFS tree rooted at s in G, computed according to the weight 
assignment W. For every ej G T , let T (ej) be the BFS tree rooted at s in G \ {ej}. 
Then the final FT-BFS tree is given by 

T*(s)=T U |J T ( ej ). 

The correctness is immediate by construction. 
Observation 4.2 T*(s) is an FT-BFS tree. 

Proof: Consider a vertex v and an edge e. If e ^ 7r(s, v), then 7r(s, v) C X*(s) \{e}, hence 
dist(s,v,r*(s) \{e}) = \tt(s,v)\ = dist(s, v, G \ {e}). Otherwise, e G n(s,v) C T . Then 
by construction, T (e) C T*(s). By definition, dist(s, v, T*(s) \ {e}) = dist(s, v, T (e)) = 
dist(s, v,G\ {e}). The observation follows. | 

Due to [26] each of the n — 1 BFS trees T (ej) can be constructed in 0{m) time, hence 
0(nm) rounds are required in total. It therefore remains to bound the size of T*(s). 

Size Analysis We first provide some notation. For a path P, let Cost(P) = ^2 eeP W(e) 
be the weighted cost of P, i.e., the sum of its edge weights. An edge e G G is defined as 
new if e £ E(T ). For every v { G V and ej G T , let P*j = 7r(s, v h T (ej)) G SP(s, v h G \ 
{ej},Vl^) be the optimal replacement path of s and ^ upon the failure of ej G T . Let 
New(P) = E{P) \ E(T ) and 

New(^) = {LastE(^) | e 3 G T } \ P(T ) 



12 



be the set of V{ new edges appearing as the last edge in the replacement paths P*j of V{ 
and ej G Tq. It is convenient to view the edges of To(e^) as directed away from s. We 
then have that 

T* = r U |J New(^). 

v t ev\{s} 

I.e., the set of new edges that participate in the final FT-BFS tree T* are those that appear 
as a last edge in some replacement path. 

We now upper bound the size of the FT-BFS tree T*. Our goal is to prove that New(^) 
contains at most 0(y/n) edges for every Vi G V. The following observation is crucial in 
this context. 

Observation 4.3 If LastE(P* i ) ^ E(T Q ), then ej G 7r(s,^). 

Proof: Assume, towards contradiction, that ej £ 7r(s, Vi) and let P*^ C T (ej) be the s— ^ 
replacement path in G \ {ej} according to the weight assignment W. Since LastE(P*^) ^ 
E(Tq), we have two different s — Vi shortest paths in G \ {ej}, namely, it(s,Vi) and 
P*j. By the optimality of ir(s,Vi) in G, i.e., 7r(«s,^) G SP(s, G, W), it holds that 
Cost(7r(s, m)) < Cost(P*j). On the other hand, by the optimality of P*j in G \ {e^}, i.e., 
P*j G SP(s,Vi,G\{ej},W), we have that Cost(7r(s, u)) > Cost(P^). Contradiction. | 

Obs. |4.3| also yields the following. 

Corollary 4.4 ^New(^) = {LastE(i^) | ej G ir(s^Vi)} \ E(Tq) and 
(2) |New(v»)| < min{depth(^),deg(^)}. 

This holds since the edges of New(^) are coming from at most depth(^) replacement 
paths P*j (one for every ej G 7r(s, ?;$)), and each such path contributes at most one edge 
incident to V{. 

For the reminder of the analysis, let us focus on one specific node u = Vi and let 
7r = 7r(s, u), N = |New(it)|. For every edge e& G New(^), we define the following parameters. 
Let f(ek) G n be the failed edge such that G T (/(e^)) appears in the replacement path 
Pk = 7v(s,u,T f ) for T" = To(/(ejfe)). (Note that might appear as the last edge on the 
path 7r(s, ia, ^(e')) for several edges e' G 7r; in this case, one such e' is chosen arbitrarily). 

Let bk be the last divergence point of Pk and 7r, i.e., the last vertex on the replacement 
path Pk that belongs to V(tt) \ {u}. Since LastE(P^) ^ E(T ), it holds that bk is not the 
neighbor of u in P&. 

Let New(?i) = {ei, . . . , e^} be sorted in non-decreasing order of the distance between 
bk and u ) dist(&&, u, tt) = \7r(bk,u)\. I.e., 

dist(6i, n) < dist(6 2 , u, n) . . . < dist(6^, tt). (1) 

We consider the set of truncated paths P' h = Pk[bk,u] and show that these paths are 
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vertex-disjoint except for the last common endpoint u. We then use this fact to bound 
the number of these paths, hence bound the number TV of new edges. The following 
observation follows immediately by the definition of b k . 

Observation 4.5 (V(P^) n V(tt)) \ {b k) u} = 0. 

Lemma 4.6 (V(Pf) n V(Pf)) \ {u} = for every ij e {1, . . . , N}, i ^ j. 
Proof: Assume towards contradiction that there exist i ^ j, and a node 

u> e (V(p;) n V(p;)) \ {u} 



in the intersection. Since LastE(P/) ^ LastE(P-), by Obs. 4.5 we have that P[,P\ Q 



G \ E(tt). The faulty edges f{e i ) 1 f{ej) belong to E(tt). Hence there are two distinct 
u' — u shortest paths in G \ {/(e*), f(ej)}. By the optimality of P[ in T (/(e^)), (i.e., 
Pi e SP(s, u, G\{f{e i )} ) W)), we have that Cost^'^', u\) < Cost(Pj[^ / , u]). In addition, 
by the optimality of P- in T (f(ej)), (i.e., Pj G SP(s,u,G \ {f(ej)},W)), we have that 
Cost (P-[u f ,u\) < Cost(P! [u f ,u]). Contradiction. | 

We are now ready to prove our key lemma. 
Lemma 4.7 |New(^)| = 0(n 1 / 2 ) for every u e V . 



4.6 



we 



Proof: Assume towards contradiction that N = |New(iz)| > \/2n. By Lemma 
have that 6 1? . . . , b^ are distinct and by definition they all appear on the path tt. There- 
fore, by the ordering of the P kl we have that the inequalities of Eq. ([!]) are strict, i.e., 
dist(fri, u, 7r) < dist(6 2 , u, n) < ... < dist(&jv, u, tt). Since b\ ^ u (by definition), we also 
have that dist(&i, u, tt) > 1. We Conclude that 

dist(&fc,M,7r) = \7v(b k ,u)\ > k . (2) 

Next, note that each P k is a replacement b k — u path and hence it cannot be shorter than 
7r(bk,u), implying that \P' k \ > \7r(bk,u)\. Combining, with Eq. ([2]), we have that 

\P' k \ > k for every k e {1, . . . , N} . (3) 



Since by Lemma 4.6, the paths P k are vertex disjoint (except for the common vertex u), 
we have that 



N 



{J{V{P' k )\{u}) 



N N 
k=l k=l 

where the first inequality follows by Eq. ^ and the last inequality by the assumption 
that N > y/2n. Since there are a total of n nodes in G, we end with contradiction. | 



k=i 



Turning to the case of a single vertex failure, the entire proof goes through almost 
without change, yielding the following. 
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Theorem 4.8 There exists a polynomial time algorithm that for every n-vertex graph 
and source node s constructs an FT-BFS tree from s tolerant to one vertex failure, with 
0(n • min{Depth(s), y^}) edges. 



4.2 Multiple Sources 

For the case of multiple sources, we establish the following upper bound. 

Theorem 4.9 There exists a polynomial time algorithm that for every n-vertex graph 
G = (V,E) and source set S C V of size \S\ = a constructs an FT-MBFS tree T*(S) 
from each source Si G S, tolerant to one edge or vertex failure, with a total number of 
n • min{^ s . GS , depth(^), 0(^/arl)} edges. 



The algorithm As in the single source case, to avoid complications due to shortest- 
paths of the same length, all shortest path distances in G are computed using a weight 
function W defined so as to ensure the uniqueness of a single u — v shortest-path. For 
every G S and every ej G To(^), let T(s^ ej) be the BFS tree rooted at Si in G \ {ej}. 
Let 

US) = |J ToM 

be the joint structure containing all the BFS trees of S. Then by the previous section, 
the FT-BFS tree for s« is T*(s;) = T U \Je 3 eT ( Si ) T ( s ii e i)- Define the FT-MBFS for S as 

T*(S)=\jT*( Si )= |J T( Si ,ej). 

s t eS Si(E s,ej(ETo(si) 



Analysis The correctness follows immediately by the single source case. It remains to 
bound the number of edges of T*(S). An edge e is new if e ^ T (S). For every vi G V 
define its new edge set in the graph T*(S) by 

New(5,Vi) = {LastE(7r(s,i; < ,r(si,e i ))) I s t e S, ej e T ( Sl )} \ E(T (S)). 

To bound the size of T*(S), we focus on node u = and bound its new edges New^, u) = 
{ei, . . . , ejy}. Obs. 4.3 yields the following. 

Corollary 4.10 New(S» < ^ s . €5 depth(^). 

Towards the end of this section, we prove that New(S f , u) contains at most O(y / on) new 
edges. For ease of notation, let 7r(^) = 7r(^, u) for every i G {1, . . . , a}. For every edge 
ek G New(5, u) ) we define the following parameters. Let s(ek) G 5 and /(e^) G To(s(efc)) 
be such that G T(s(ek), /(e^)). I.e., the edge e& appears in the replacement s(efc) — 
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path Pfc = 7r(s,w, T"), where T" = T(s(ejfe), f(e>k)) is the BFS tree rooted at s(efc) in 



G \ {/(ejfe)}. By Obs. 4.3, /(e^) G 7r(s(efc)). (Note that for a given new edge there 
might be several s' and e' such that = LastE(7r(V, u, T(s', e'))); in this case one such 
pair s',e' is chosen arbitrarily.) For every replacement path (whose last edge is e*.), 
denote by 6^ the Zast divergence point of P]^ and the collection of shortest Si — u paths 
V = {J Si es 7r ( s ii u ) \ i u }- -'-• e -' is the last point on P^ that belongs to V(V) \ {u}. Let 
P' k = Pk[bk,u] be the truncated path from the divergence point bk to u. Note that since 
e = (x, u) = LastE(P/e) ^ E(Tq(S)) is a new edge, it holds that x £ V(V) \ {u} and bk is 
in V \ {u}. The following observation is useful. 

Observation 4.11 F fe CG\ E(V) for every k G {1, . . . , N}. 

We now show that the paths P f k are vertex disjoint except for their endpoint u (this is 
regardless of their respective source s(efc)). 

Lemma 4.12 (V(P[) n ^(Pj)) \ M = /or even/ i ^ j G {1, . . . , N}. 
Proof: Assume towards contradiction that there exists i ^ j, and a node 

^(m')n^))\W 



in the intersection. Since LastE(i^) ^ LastE(Pj) and by Obs. 4.11, P[, P- CG\ E(V), 
and the faulty edges /(e^), / (e^) G P, we have two distinct u' — u replacement paths in 
G \ {/(eO, /(e,)}. By the optimality of i* in T(s(e,), /(e,)), (i.e., G SP(s(eO, G \ 
{f( e i)}> W)), we have that Cost(i^) < Cost(Pj). Similarly, by the optimality of P- 
in T{s{e j )J{e j )) ) (i.e., Pj G SP{s{e j ) ) u ) G \ {/(e,)}, VP)), we have that Cost(Pj) < 
Cost(P/), contradiction. The lemma follows. | 

We are now ready to state and prove our main lemma. 
Lemma 4.13 N = |New(5»| = 0(y/an). 

We begin by classifying the set of new edges G New^, u) into a classes according to the 
position of the divergence point b{. For every G New(5, u), let s(ef) G 5 be some source 
node such that the divergence point bi G 7r(^(e^), appears on its ^(e^) — shortest path 
T (S). If there are several such sources for the edge e*, one is chosen arbitrarily. 

For every Sj G S, let 

New(<Sj) = {e^ G New(5, tx) | s"(e^) = Sj} 

be the set of new edges in New(S f , iz) that are mapped to Sj G 5. Then, New(S f , u) = 
U Si G5 New ( 5 i)- Let x j = l New ( s j)l- 

We now focus on sj. For every e Jfc G New(sj), = {1, . . . , xj}, let Pj k = 7r(<s(e Jfc ), ?x, T f ) 
for X' = T(s(ej k ), f(ej k )) be the replacement path such that LastE(P Jfc ) = e Jfc and bj k 
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be its corresponding (last) divergence point with 7r(sj,u) (sj = ^(e Jfc )). In addition, the 
truncated path is given by P r - k = Pj k [bj k , u] . Note that LastE(P Jfc ) = e Jfc . 

Consider the set of divergence points bj 1 , . . . , bj sorted in non-decreasing order of the 
distance between bj k and Sj on the shortest Sj — u path ir(sj) i.e., \^{bj k) u, T (<Sj))|, where 

1^(6^,^X0^))! < \7r(b j2 ,u,T (sj))\... < \7T(bj u,T ( Sj ))\ . (4) 



Note that by Lemma 4.12, bj e ^ bj , for every t,t! £ {1, . . . , Xj}. In addition, since each 
bj £ 7^ u, \7r(bj 1 ,u,T (sj))\ > 1. Hence, since 6^, . . . ,bj x . G 7r(sj), combining with Eq. (j3J) 
we get that 

1 < 1^,^X0^))! < Wb j2 ,u,T (s j ))\...< |7r(6 % ,^T (^))| • (5) 

Since Pj £ is an alternative bj £ — u replacement path, we have that 

\P' k \ > \n(b h ,u,T ( Sj ))\>£. (6) 

where the last inequality follows by Eq. Hence, since all Pj £ are vertex disjoint, except 
for the last node u, we get the total number of nodes V(sj) = [J V(Pj e ) \ {u} occupied by 
Pj £ paths is 

ETOi = i%)i = o(^ 

£=1 

Since the nodes of V(sj 1 ) and V(sj 2 ) are disjoint for every Sj 1 ,Sj 2 G S, by Lemma 
412[ it follows that |New(S»| = EJ=i xj but EJ=il^( 5 j)l = °(^ 2 ) < n - Therefore, 
|New(5,tz)|=EJ=i^<0(vM- I 



As there are n nodes, combining with Cor. |4.10[ we get that the total number of edges 
in T*(S) is given by 

E(T*(S)) < \E(T (S))\ + J2\Tiev(S,u)\ < an + n • min{^ depth(^), 0(V™)}, 



as required. Thm. 4.9 is established. The analysis for the case of vertex faults follows 



with almost no changes. | 

5 Hardness of Approximation of the Minimum FT-BFS 
Problem 

In this section we establish the following. 

Theorem 5.1 The Minimum FT-BFS problem is NP-complete and cannot be approxi- 
mated to within a factor clog n for some constant c> unless MV C TlM£(n polyl ° s{n) ). 
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We prove Theorem |5.1| by showing a gap preserving reduction from the Set-Cover problem 
to the Minimum FT-BFS problem. An instance (U,$) of the Set-Cover problem consists 
of a set of TV elements U = {u\, . . . , u^} and a collection of M sets $ = {Si, . . . , Sm} 
such that Si C U and |J Si — U. The task is to choose the minimal number of sets in 
# whose union covers all of U. Fiege [13] showed that the Set Cover problem cannot be 
approximated within a ratio of (1 - o(l)) Inn unless MV C TlM£(M polyl °^). 

The Transformation. Given a Set-Cover instance ([/, we construct a Minimum 
FT-BFS instance Z(E7, 30 = (G, 5) as follows. Let X = {xi, . . . , %} (resp., Z = {zi, . . . , Zjy}) 
be the vertex set corresponding to the collection of sets $ (resp., elements of U). Let 
Bxz = (X, Z,E X z) be the bipartite graph corresponding to the input where 
E X z = {(xj,Zi) I Ui G j G {1, . . . ,M} and i G {1,...,X}}. Embed the bi- 
partite graph B X z in G in the following manner. Construct a length-(X + 1) path 
P — [s — poiPi • • • ,Ptv,Pn+iL connect a vertex to and connect a set of ver- 
tices Y = {7/1, . . . ,yn} for R = 0((MN) 3 ) to the vertex Pn+i by the edges of E pY = 
{(pN+iiVi) I i £ {lj---j-R}}- Connect these vertices to the bipartite graph B X z as 
follows. For every i G {1,...,X}, connect the node pi_i of P to the node 2^ of Z 
by a path Qi = \pi-i = q l ,...,q l t . = zi\ where U = \Qi\ = 6 + 2(N — i). Thus the 
paths Qi are monotonely decreasing and vertex disjoint. In addition, connect the vertices 
v' and Pn+i to every vertex of X, adding the edge sets E vX = {(V,x^) | Xi G X} 
and E pX = {(pjv+ij^j) | £j G X}. Finally, construct a complete bipartite graph 
Bxy = {X ) Y ) E X y) where E X y = {(yi^Xj) \ xj G X, yn G y}. This completes the 
description of G. For illustration, see Fig. [4j Overall, 

N 

v(G) = xuzu v(P) u |J y (q^ u {*/} u y, 

?;=i 

and 

A/" 

E(G) = E xz U £(P) U U E(Qi) U {(pjv, v')} U S py U £ vX U E pX U 

?;=i 

Note that \V(G)\ = O(R) and that \E(G)\ = 0(\E XZ \ + X 2 + Mi?) = O(ATR). 
First, note the following. 

Observation 5.2 Upon the failure of the edge = (pi-i,pi), i G {1, . . . , X} ; i/ie follow- 
ing happen: 

(a) the unique s — Zi shortest path in G\ {e^} is given by Pi = P[s,p^_i] o Q { . 

(b) the shortest-paths connecting s and the vertices of {pniPn+IiV 1 } U X UY disconnect 
and hence the replacement paths in G\ {e^} must go through the Z nodes. 
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Figure 4: Schematic illustration of the reduction from Set-Cover to Minimum FT-BFS. In 
this example # = {Si, S 2l . . . , £5} where Si = {^i, n 4 }, S 2 = {^1, ^3}, £3 = {« 2 , a 4 }, 
S4 = {^3} and S5 = {0,1,0,4}. Thus, N = 4 and M — 5. The minimal vertex cover is 
given by £2 and S3. The vertex set Y is fully connected to X. In the optimal FT-BFS 
T*, y is required to be connected to the xj nodes that corresponds to the sets appearing 
in the optimal cover. For example, yi is connected to x 2 and x 3 which "covers" the Z 
nodes. The red edges are necessary upon the fault of e 3 . All edges described except for 
the (xj,yt) edges are required in any FT-BFS tree. 



We begin by observing that all edges except those of B X y are necessary in every 
FT-BFS tree f G T(s, G). Let E = E{G) \ E XY . 

Observation 5.3 E C f for every f G T(s, G). 

Proof: The edges of the paths P and the edges of E pY U {(pn,v')} are trivially part of 
every FT-BFS tree. The edges of the path Qi are necessary, by Obs. 5.2[ a), upon the 



failure of for every i G {1, . . . , N}. To see that the edges of E vX are necessary, note 
that upon the failure of the edge (pn,Pn+i) ot the edge (p^+i?^)? the unique s — ou- 
tplacement path goes through v f for every j G {1, . . . , M}. Similarly, the edges E pX are 
necessary upon the failure of (pn->v') or (V,Xj). 

It remains to consider the edges of E xz . Assume, towards contradiction, that there 
exists some T f G T(s, G) that does not contain = (xj, Zj) G E xz . Note that by Obs. 
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|5.2| (a), upon the failure of the edge = (pi-i,Pi) G P, the unique 5 — xj shortest-path 
in G \ {ei} is P/ = 7r[pojPi-i] ° Qi° [zi, Xj], and all other alternatives are strictly longer. 
Since ^ T', also P/ ^ T', and therefore dist(s, Xj, G \ {ei}) < dist(s, x^ T' \ {e^}), in 
contradiction to the fact that T' G T(s, G). The observation follows. | 

We now prove the correctness of the reduction and then consider gap-preservation. 
Let T G T{s ) G) and define by T(y^T) = {xj \ (x^yt) G T} the X nodes that are 
connected to yi in T, for every G Y\ Let ft(T) = min{|r(y^,T)| | yt G F}. Note that 
since the edges of E are necessary in every T G T(s, G) it follows that 

|P(f)| > |P| + /<f).P. (7) 

Lemma 5.4 if T G T(s, G) i/ien there exists a Set-Cover for (U,$) of size at most k(T). 

Proof: Consider f G T(s, G) and let ^ G F be such that |r(^,f)| = rc(f). A cover 
for U for size ft(T) is constructed as follows. Let = {Sj \ xj G Y{y^ T)}. By definition, 
1^1 = k(T). We now claim that it is a cover for U. Assume, towards contradiction, that 
there exists some U{ G U not covere d by Consider the graph G f = G \ {e^} where 
ei = Recall that by Obs. |5.2| (a), P& = P[s,pk-i] o Q fc is the 5 — ^ path in 

G \ {e/e}. Note that P& ^ G' for every k > i and |P^| > |P^| for every k < i. Hence 
denoting the set of neighbors of Zi in X by T(zi) = | (z^Xj) G Pxz}, by Obs. 
|5.2[ b), the unique s — Xj shortest-path, for every Xj G T(zi) such that (zi,Xj) G E X y^ 
is given by Pj = P^ o Therefore the s — yi shortest-paths in G' are all given by 

P- o (xj, ^), for every Xj G T(^). But since (xj,yi) £ T for every xj G T(^), we have that 
dist(s, y^ G f ) < dist(s, y^ T \ {e^}), in contradiction to the fact that T G T(s, G). I 



Lemma 5.5 If there exists a Set-Cover of size k then Cost*(s, G) < \E\ + k • P. 

Proof: Given a cover C |#'| = construct a FT-BFS tree T G T(s, G) with |P|+^-P 
edges as follows. Add P to T. In addition, for every Sj G add the edge {y^Xj) to T 
for every ^ G Y. Clearly, \E(f)\ = \E\ + k • R. It remains to show that f G T(s,G). 
Note that there is no s — u replacement path that uses any yg G Y as a relay, for any 
^ G V(G) and ^ G Y"; this holds as X is connected by two alternative shortest-paths to 
both Pn+i and to v' and the path through yg is strictly longer. In addition, if the edge 
e G {{pniPn+i), fails, then the s — yt shortest path in G \ {e} goes through 

any neighbor xj of yg. Since each yg has at least one X node neighbor in T, it holds that 
dist(s, y i: T \ {e}) = dist(s, ^, G \ {e}). 

Since the only missing edges of T, namely, P(G)\P(T), are the edges of P^y, it follows 
that it remains to check the edges = ^) for every i G {1, . . . , A}. Let 6j G such 

that G iSj. Since is a cover, such Sj exists. Hence, the optimal s — yg replacement 



path in G \ {e^}, which is by Obs. 5.2(b), P r = Pi o {z^ Xj) o (xj,y£), exists in T \ {e^} for 
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every y £ G Y. It follows that T G T(s, G), hence Cost*(s, G) < \E(T)\ = \E\ + k-R. The 
lemma follows. | 

Let k* be the cost of the optimal Set-Cover for the instance (U,$). We have the 
following. 

Corollary 5.6 Cost*(s,G) = \E\ + k* > R. 

Proof: Let T* G T(s, G) be such that \E(T*)\ = Cost*(s, G). It then holds that 

\E\ + k(T*) • R < \E(T*)\ = Cost*(s, G) < \E\ + k* • R, 
where the first inequality holds by Eq. ([7| and the second inequality follows by Lemma 



5.5, Hence, k(T*) < k*. Since by Lemma [5T4| there exists a cover of size k(T*), we have 
that k* < k(T*). It follows that k* = k(T*) and Cost*(<s,G) = \E\ + k* • R as desired. 
I 

We now show that the reduction is gap-preserving. Assume that there exists an 
a approximation algorithm A for the Minimum FT-BFS problem. Then applying our 
transformation to an instance X{U ) $) = (G, s) would result in an FT-BFS tree T G T(s, G) 
such that 

|^| + k(T) • R < \E(f)\ < a(\E\ +K*-R)<3a-K*-R, 



where the first ineq 
of A a nd b y Cor 
Lemma 



5.4 



uality follows by Eq. ([7]), the second by the approximation guarantee 
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and the third inequality follows by the fact that \E\ < 2R. By 
a cover of size k(T) < 3a k* can be constructed given T, which results in a 3a 
approximation to the Set-Cover instance. As the Set-Cover problem is inapproximable 
within a factor of (1 — o(l))lnn, under an appropriate complexity assumption [15] . we 
get that the Minimum FT-BFS problem is inapproximable within a factor of c • log for 
some constant c > 0. This complete the proof of Thm. |5.1[ 



6 O (log n)- Approximation for FT-MBFS Trees 



In Sec. |4.1[ we presented an algorithm that for every g raph G and source s constructs 

we showed that there exist 



3.1 



an FT-BFS tree T G T(s, G) with 0(n 3 / 2 ) edges. In Sec. 
graphs G and s G V(G) for which Cost*(s,G) = f](n 3 / 2 ), establishing tightness of our 
algorithm in the worst-case. Yet, there are also inputs (G 7 , s') for which the algorithm of 
Sec. [ij as well as algorithms based on the analysis of [14] and [23] . might still produce an 
FT-BFS T G T(s' , G f ) which is denser by a factor of Q(^/n) than the size of the optimal 
FT-BFS tree, i.e., such that \E{T)\ > Q(y/n) • Cost*(5 / , G'). For an illustration of such a 
case consider the graph G f = (V, E) which is a modification of the graph G described in 
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Sec. |3.1| The modifications are as follows. First, add a node z to Z and connect it to 
every X{ £ X. Replace the last edge = LastE(i^) of the V{ — Zi path Pi by a vertex 
that is connected to the endpoints of the edge e[ for every i £ {1, . . . , d}. Let P[ be the 
s — Zi modified path where LastE(P/) = (r^, Zi). Finally, connect the node z to all nodes 
Ti for every i £ {1, . . . , d}. See Fig. [5] for illustration. 



d = 0(Jn) 




Figure 5: Bad example for the algorithm of Sec. |4| The weights of the z edges are larger 
than those of the other edges. Thus, the entire complete bipartite graph £>(X, Z \{z }) 
of size fi(n 3 / 2 ) is included in the resulting FT-BFS tree T £ T(s, G) returned by the 
algorithm. However, an FT-BFS tree T* of 0(n) edges can be given by including the edges 
of (zo,Xi) for every Xi £ X. The red edges are two optional edges necessary upon the 
failure of e$. Adding the edge (xj,z ) is better, yet the algorithm of Sec. [3] adds (xj,Zi) 
to T for every xj £ X. 



Observe that whereas Cost* (5, G) = Q(n 3/2 ), the modified G r has Cost* (5, G') = O(n), 
as the edges of the complete bipartite graph B that are required in every T £ 7~(s, G) are 
no longer required in every T' £ T(s, G'); it is sufficient to connect the nodes of X to z 
only, and by that "save" the Q(n 3 / 2 ) edges of B in T' . Nevertheless, as we show next, for 
certain weight assignments the algorithm of Sec. [i] constructs an FT-BFS tree T of size 
0(n 3 / 2 ). Specifically, let W be such that each of the edges of 

E' = {(z ,r t ) I ie{l,...,d}}U{(zo,Xi) | x % £ X} 

is assigned a weight which is strictly larger than the weights of the other edges. That 
is, W(ek) > W(ei) for every e*. £ E' and £ E{G') \ E' . Note that for every edge 
ei = (vi,Vi+i) £ 7r, i £ {1, . . . there are two alternative s — xj replacement paths 
of the same length, namely, Qij = 7r[s,^] o P! o (z^Xj) that goes through z^ and Q ij<7 - = 
7r[s, ^i] P/[s, o (r^, zq) (^0? #i) that goes through z . Although |Qij| = |Qij|, the weight 
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assignment implies that Cost(Q^) < Cost(Q^) and hence Q ij<7 - ^ SP(s,xj,G \ {e^^W) 
for every z G and every Xj G X. Therefore, E(B) C T, for every FT-BFS 

tree T computed by the algorithm of Sec. [I] with the weight assignment W. Hence 
\E(f)\ = 0(n 3 / 2 ) while Cost*(s,G") = 0(n). 

Clearly, a universally optimal algorithm is unlikely given the hardness of approximation 
result of Thm. |5.1[ Yet the gap can be narrowed down. The goal of this section is to 
present an O(logn) approximation algorithm for the Minimum FT-BFS Problem (hence 
also to its special case, the Minimum FT-BFS Problem, where \S\ = 1). 

To establish this result, we first describe the algorithm and then bound the number of 
edges. Let ApproxSetCover(#, U) be an O(logn) approximation algorithm for the Set- 
Cover problem, which given a collection of sets # = {Si, • . . , Sm} that covers a universe 
U = {ui, . . . ,Un} of size AT, returns a cover C $ that is larger by at most O(logTV) 
than any other C $ that covers U (cf. [27]). 

The Algorithm Starting with T = 0, the algorithm adds edges to T until it becomes 
an FT-MBFS tree. 

Set an arbitrary order on the vertices V(G) = {^i, . . . ,v n } and on the edges E + = 
E(G) U {eo} = {^o, • • • , e m } where e is a new fictitious edge whose role will be explained 
later on. For every node V{ G V ) define 

Ui = {(s kl ej) | s k G S \ {vi} : ej G E + }. 

The algorithm consists of n rounds, where in round i it considers Let Y{v^G) = 
{ui, . . . , Udt} be the set of neighbors of V{ in some arbitrary order, where di = deg(^, G). 
For every neighbor define a set Sij C Ui containing certain source-edge pairs (s k) eg) G 
Ui. Informally, a set Sij contains the pair (s^, eg) iff there exists an s k — Vi shortest path in 
G \ {eg} that goes through the neighbor Uj of V{. Note that Sij contains the pair (sk, eo) 
iff there exists an s k — Vi shortest-path in G \ {e } = G that goes through Uj. I.e., the 
fictitious edge e is meant to capture the case where no fault occurs, and thus we take 
care of true shortest-paths in G. Formally, every pair (sk, eg) G Ui is included in every set 
Sij satisfying that 

dist(sfc, uj, G \ {eg}) = dist(s fej v h G \ {eg}) - 1. (8) 

Let $i = {S^i, . . . , Siji}- The edges of Vi that are added to T in round i are now selected 
by using algorithm ApproxSetCover to generate an approximate solution for the set cover 
problem on the collection $ = {Sij | uj G Y(vi, G)}. Let = ApproxSetCover^, Ui). 
For every Sij G 3^, add the edge (uj,Vi) to T. We now turn to prove the correctness of 
this algorithm and establish Thm. |6.5[ 
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Analysis We first show that algorithm constructs an FT-MBFS T G T{S ) G) and then 
bound its size. 

Lemma 6.1 f eT(S,G). 

Proof: Assume, towards contradiction, that T £ T{S ) G). Let s G S be some source 
node such that T £ T(s, G) is not an FT-BFS tree with respect to s. By the assumption, 
such s exists. Let 

BP = {(i, fc) | Vi G V, e fc e E + and dist(s, ^, f \ {e k }) > dist(s, ^, G \ {e fc })} 

be the set of "bad pairs," namely, vertex-edge pairs (i, fc) for which the s — Vi shortest path 
distance in T\{e&} is greater than that in G\{e^}- (By the assumption that T £ T(s, G), 
it holds that BP ^ 0.) For every vertex-edge pair (i, fc), where ^ G V \ {5} and G P + , 
define an s — V{ shortest-path P* k in G \ {e/J in the following manner. Let uj G T(vi, G) 
be such that the pair (s, e&) G 5^ is covered by the set S^j of and S^- G ^ is included 
in the cover returned by the algorithm ApproxSetCover in round i. Thus, (uj,Vj) G T 
and dist(s, G \ {ek}) = dist(s, G \ {ek}) — 1. Let P' G SP(s, G \ {ek}) and define 

i^ = P'o(^). 

By definition, \P* k \ = dist(s,^,G \ {ek}) and by construction, LastE(P i * fc ) G T. Define 
PP(i, fc) = P*k \ E(T) to be the set of "bad edges," namely, the set of P* k edges that 
are missing in T. By definition, PP(i, k) ^ for every bad pair (i, fc) G BP. Let 
fc) = max eG ££(^){dist(«s, e, P*/.)} be the maximal depth of a missing edge in PP(i, fc), 
and let DM{i,k) denote that "deepest missing edge" for (i, fc), i.e., the edge e on P*^ 
satisfying d{i,k) = dist(s, e, P* k ). Finally, let (i',k f ) G BP be the pair that minimizes 
d{i ) k) ) and let ei = (v^^v^) G BE(i f ,k') be the deepest missing edge on P*/ k /, namely, 
e\ = DM(i r ,k r ). Note that e\ is the shallowest "deepest missing edge" over all bad 
pairs G PP. Let Pi = i^* jjfe ,, P2 = ^^[s,^] and P 3 = P^/^u, ?v]; see Fig. [6]for 
illustration. Note that since (Y, k') G PP, it follows that also A; 7 ) G PP. (Otherwise, if 
(ii, fc') ^ PP, then any 5— v ix shortest-path P' G SP(s, v ix ^ T\{e^}) , where |P'| = \P* fc/ |, 
can be appended to P 3 resulting in P" = P' o P 3 such that (1) P"CT \ {e^/} and (2) 
|P"| = |P'| + |P 3 | = \P 2 \ + |P 3 | = \P^A contradicting the fact that {i 1 \ k') G PP.) Thus 
we conclude that (i\,k') G PP. Finally, note that LastE(Pi) G T by definition, and 
therefore the deepest missing edge of (i, fc) must be shallower, i.e., < d(i / ) k / ). 

However, this is in contradiction to our choice of the pair (i 1 \ k f ). The lemma follows. | 

Let W : E(G) — » M>o be the weight assignment that guarantees the uniqueness 
of shortest-paths. Note that the algorithm did not use W in the computation of the 
shortest-paths. For every node let T(^, G) = {u\, . . . , u^} be its ordered neighbor set 
as considered by the algorithm. For every FT-MBFS tree T G T(S, G), vi G V, G P + and 
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Figure 6: Red solid lines correspond to new edges. The "deepest missing edge" for (z 7 , /c 7 ), 
edge ei, is the shallowest such edge over all bad pairs in BP. Yet the pair k') is bad 
too. As the last (green) edge of Pi is included in the FT-MBFS tree, and since Pi and P 2 
are of the same length, it follows that Pi has a shallower "deepest missing edge" . 

Sk G S, let Pi{sk ) eg) G SP(sk, v i) T \ {q}, W) be an sj~ — Vi shortest-path in T \ {q}. Let 

A i (f) = {LastE(Pi(s jb ,e i )) | 

be the edges of Vi that appear as last edges in the shortest-paths and replacement paths 
from S to vi in T. Define 

UT) = {S itj | (i^) e MT)}- 

We then have that 

\un = \Mn • (9) 
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The correctness of the algorithm (see Lemma 6.1) established that if a subgraph T C G 
satisfies that &(T) is a cover of Ui for every vi G V, then T G G). We now turn to 
show the reverse direction. 

Lemma 6.2 For every T G T(S,G), the collection 3*(T) is a co^er of Ui, namely, 

Proof: Assume, towards contradiction, that there exists an FT-MBFS tree T G T{S ) G) 
and a vertex V{ G V whose corresponding collection of sets &(T) does not cover Ui. Hence 
there exists at least one uncovered pair (s*., eg) G t^, i.e., 

(sk^t)eUi\ |J S^-. (10) 

By definition ^ Vi. We next claim that T does not contain an optimal <s& — Vi path 
when the edge eg fails, contradicting the fact that T G T(/S, G). That is, we show that 

dist(sfc, v h T \ {e £ }) > dist(s fej v u G \ {e £ }). 

Towards contradiction, assume otherwise, and let (uj,Vi) = LastE(P^) where P* £ G 
SP(sk,Vi,T \ {e^},Vl^), hence (uj,Vi) G Ai(T) and Sij G &(T). By the contradictory 
assumption, |P* £ | = dist(«s/ c , G \ {eg}) and hence dist(sfc, G \ {eg}) = dist(sfc, G \ 



{eg}) — 1. This implies that (s^e^) G S^j G ^(T), in contradiction to Eq. (10), stating 
that (sfc, eg) is not covered by &(T). The lemma follows. | 

We now turn to bound that number of edges in T. 
Lemma 6.3 \E(f)\ < O(logrc) > Cost* (S,G). 

Proof: Let S = c log n be the approximation ratio guarantee of ApproxSetCover. For ease 
of notation, let Oi = Ai(T*) for every Vi G V. Let = {Si,i, . . . , S^.} be the collection 
of ^ sets considered at round i where Sij C is the set of the neighbor i^- G T(^, G) 
computed according to Eq. ([8]). 

Let ^ = ApproxSetCover Ui) be the cover returned by the algorithm and define 
Ai = {(uj, | Si j G 3^} as the collection of edges whose corresponding sets are included 
in S[. Thus, by Eq. @, \O x \ = |&(T*)| and \A t \ = |$| for every v % G V. 

Observation 6.4 \Ai\ < 5\Oi\ for every Vi G V \ {s}. 

Proof: Assume, towards contradiction, that there exists some i such that \Ai\ > S\Oi\. 
Then by Eq. ^ and by the approximation guarantee of ApproxSetCover where in 
particular |&(T)| < 5\$"\ for every C & that covers it follows that 3*(T*) is not 
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a cover of Ui. Consequently, it follows by Lemma 6.2 that T* ^ T{S ) G) ) contradiction. 
The observation follows. | 



Since [J A{ contains precisely the edges that are added by the algorithm to the con- 
structed FT-MBFS tree T, we have that 

\E(T)\ < ^2\Ai\<6^2\Oi\<26'Co3t*(S,G) , 



where the second inequality follows by Obs. 6.4 and the third by the fact that \E(T*)\ > 



^ \Oi\/2 (as every edge in U^ev ^ i can ^ e coun ted at most twice, by both its endpoints). 
The lemma follows. | 

The following theorem is established. 

Theorem 6.5 There exists a polynomial time algorithm that for every n-vertex graph G 
and source node set S C V constructs an FT-MBFS tree T G T(S, G) such that \E(T)\ < 
O(logrc) - Cost* (5, G). 
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